Fusion for Visual Recognition

نویسندگان

  • Liangliang Cao
  • Mark A. Hasegawa-Johnson
  • Jiawei Han
چکیده

In the past decade, the popularity of the Internet and digital cameras has led to a flourishing of images and videos. Surveillance videos are increasing explosively with the huge amounts of surveillance cameras. Compared with traditional datasets in computer vision, which host only thousands of images, these largescale datasets in the era of the Internet have grown beyond the wildest imagination, and posed a serious challenge for visual recognition and detection. To handle the challenge of visual recognition in complicated scenarios, we that a single feature is not enough to distinguish web-scale visual concepts. Accordingly, this dissertation proposes to combine heterogeneous features for different visual recognition tasks. We first develop a machinery called Heterogeneous Feature Machines to effectively fuse multiple types of visual features. In addition, we realize that in specific applications such as consumer photo annotation or surveillance action detection, there are also specific cues which are helpful for visual recognition tasks. We consider three scenarios: (1) consumer photo recognition, where we explore the use of metadata such as time and GPS, (2) Web image searching and annotation, where we combine both user tags and network information for visual applications, and (3) action detection in videos, where the spatial-temporal coherence is combined with multiple visual features for detection tasks. We believe heterogeneous feature fusion is useful in a wide range of applications and merits research efforts in this promising direction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fusion Framework for Emotional Electrocardiogram and Galvanic Skin Response Recognition: Applying Wavelet Transform

Introduction To extract and combine information from different modalities, fusion techniques are commonly applied to promote system performance. In this study, we aimed to examine the effectiveness of fusion techniques in emotion recognition. Materials and Methods Electrocardiogram (ECG) and galvanic skin responses (GSR) of 11 healthy female students (mean age: 22.73±1.68 years) were collected ...

متن کامل

Multi-Focus Image Fusion in DCT Domain using Variance and Energy of Laplacian and Correlation Coefficient for Visual Sensor Networks

The purpose of multi-focus image fusion is gathering the essential information and the focused parts from the input multi-focus images into a single image. These multi-focus images are captured with different depths of focus of cameras. A lot of multi-focus image fusion techniques have been introduced using considering the focus measurement in the spatial domain. However, the multi-focus image ...

متن کامل

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

 In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

Urban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data

Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...

متن کامل

Application of Combined Local Object Based Features and Cluster Fusion for the Behaviors Recognition and Detection of Abnormal Behaviors

In this paper, we propose a novel framework for behaviors recognition and detection of certain types of abnormal behaviors, capable of achieving high detection rates on a variety of real-life scenes. The new proposed approach here is a combination of the location based methods and the object based ones. First, a novel approach is formulated to use optical flow and binary motion video as the loc...

متن کامل

A Survey on Different Fusion Techniques of Visual and Thermal Images for Human Face Recognition

In this paper we do a survey on different fusion techniques of visual and thermal images for human face recognition. Image fusion constructs a single image by combining information from a set of source images together using different techniques. It is quite simpler to extract and locate facial features in visual images. Another advantage of using visual image is that it works well under control...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011